Inter domain linker region affects properties of CBM6 in GH5_34 arabinoxylanases and alters oligosaccharide product profile

Abstract Understanding the relation between enzyme domain structure and catalytic activity is crucial for optimal engineering of novel enzymes for lignocellulose bioconversion. Xylanases with varying specificities are commonly used to valorise the hemicellulose arabinoxylan (AX), yet characterization of specific arabinoxylanases remain limited. Two homologous GH5_34 arabinoxylanases, HhXyn5A and CtXyn5A, in which the two domains are connected by a 40-residue linker, exhibit distinct activity on AX, yielding different reaction product patterns, despite high sequence identity, conserved active sites and similar domain composition. In this study, the carbohydrate binding module 6 (CBM6), or the inter domain linker together with CBM6, were swapped to investigate their influence on hydrolytic activity and oligosaccharide product pattern on cereal AXs. The variants, with only CBM6 swapped, displayed reduced activity on commercial wheat and rye AX, as well as on extracted oat fibre, compared to the original enzymes. Additionally, exchange of both linker and CBM6 resulted in a reduced ratio of enzyme produced in soluble form in Escherichia coli cultivations, causing loss of activity of both HhXyn5A and CtXyn5A variants. Analysis of oligosaccharide product patterns applying HPAEC–PAD revealed a decreased number of reaction products for CtXyn5A with swapped CBM6, which resembled the product pattern of HhXyn5A. These findings emphasize the importance of the CBM6 interactions with the linker and the catalytic domain for enzyme activity and specificity, and underlines the role of the linker in enzyme structure organisation and product formation, where alterations in linker interactions with the catalytic and/or CBM6 domains, influence enzyme-substrate association and specificity.


Introduction
Glycoside hydrolases (GH) contribute to plant biomass hydrolysis by degrading a diverse range of polysaccharides.In the Carbohydrate Active enZymes (CAZy) database (Drula et al. 2022), numerous GH families are listed.Among these families, GH5 is a large family, widely distributed across archaea, bacteria and eukaryota, hosting enzymes with a variety of specificities, including for example endo-and exo-glucanases, endo-and exomannanases, 1,6-galactanases, xyloglucanases and arabinoxylanases (Drula et al. 2022).This large variation is reflected in the sequences, and has resulted in a GH5 subfamily classification that delineates family members into a number of monospecific and polyspecific clades (Aspeborg et al. 2012).Enzymes specifically targeting xylans can be found in different GH5 subfamilies, where exclusive arabinoxylanases are found in the small GH5_34 subfamily (Correia et al. 2011;Labourel et al. 2016;Norlander et al. 2023).The arabinoxylanases attributed to the GH5_34 subfamily require an α-1,3-Araf substitution in the −1 subsite position in their active site for the catalytic hydrolysis to occur (Correia et al. 2011;Labourel et al. 2016).Their relatively open catalytic cleft allows Araf substitutions in all subsites from −2 to +2 including double substitutions, making them interesting for the production of arabinose substituted products (Correia et al. 2011;Falck et al. 2018;Norlander et al. 2023).Despite being of great interest for the production of arabinoxylo-oligosaccharides, only a few subfamily members have been studied in detail (Correia et al. 2011;Labourel et al. 2016).The most well studied GH5_34 arabinoxylanase CtXyn5A from Acetivibrio thermocellus (initially Clostridium thermocellum) is a multimodular enzyme with three carbohydrate binding modules (CBM6, CBM13, and CBM62), which are suspected to aid in the hydrolysis of complex heteroxylans (Labourel et al. 2016).Attempts to produce the single catalytic module of CtXyn5A have been unsuccessful, while a truncated variant consisting of the catalytic GH5 module and a CBM6 has been successfully produced in Escherichia coli as well as structurally characterized (PDB 2Y8K) (Correia et al. 2011).The novel thermostable GH5_34 subfamily arabinoxylanase HhXyn5A from Herbinix hemicellulosilytica has recently been extensively characterized (Norlander et al. 2023).Analogously to CtXyn5A, HhXyn5A is also a multi-modular enzyme, where the two-domain enzyme variant consisting of catalytic domain and CBM6 is the smallest functional entity.Interestingly, in contrast to the commercially available two-domain CtXyn5A variant, the recently characterised HhXyn5A produces a different oligosaccharide product profile, which may be influenced by the atypical CBM6 that is significantly less conserved than the catalytic GH5 domain (Norlander et al. 2023).
CBM6 modules display the typical β-sandwich fold of the jelly roll superfamily (consisting of two sheets of antiparallel β-strands connected by loops) and is in general found in polysaccharide degrading enzymes from archaea or bacteria (Michel et al. 2009), but has, however, also been reported to be associated with enzymes originating from eukaryotes (Takaki et al. 2002).CBM6 can be found attached to enzymes with varying substrate specificities (e.g.xylanases, cellulases, agarases, laminarinases) (Michel et al. 2009), with reported binding specificities for linear and branched/decorated xylan, β-1,4-glucan (or cellulose), mixed-linked β-1,3-1,4-glucan (or lichenan), agarose, β-1,3-glucan (or laminarin) or chitin.The first characterized CBM6 originated from a multi-modular xylanase from C. thermocellum (Fernandes et al. 1999).Subsequent sequence analyses of other CBM6 structures indicated a large diversity in ligand specificity, and phylogenetic analyses of CBM6 sequences resulted in module family division into subfamilies (Czjzek et al. 2001;Abbott et al. 2009).The ligand binding site of CBM6 has either been located as a shallow cleft on the concave surface of the β-sheets (as observed in CBMs from different families), or more commonly, located within the connecting loops of the two β-sheets (the apex site or site A, in e.g.PDB 1GMM; PDB 1UXX; PDB 1NAE; PDB 1W9W).Moreover, two distinct binding sites were revealed in a CBM6 of lichenase from Cellvibrio mixtus (PDB 1UYY; PDB 1UY0; PDB 1UYZ) that displayed binding affinities for both sites (Henshaw et al. 2004;Pires et al. 2004).In CtXyn5A, the binding site of the CBM6 is located on the loops (binding site A) and is including a pair of aromatic residues, proposed as a central feature of this site (Correia et al. 2011).In contrast, the CBM6 of HhXyn5A is missing these residues, suggesting that binding is mediated by other residues and interactions, or occurring at another site of the module, as proposed by docking simulations (Norlander et al. 2023).
The described difference raised interest in the possibilities to combine the CBM6 and catalytic modules from the two GH5_34 enzymes, to investigate the influence of the exchange on the activity profiles.HhXyn5A and CtXyn5A displayed different oligosaccharide product profiles on arabinoxylan (AX) substrates, and this was suggested to be attributed to the structural differences between CBM6 domains of the enzymes (Norlander et al. 2023).In this study, the CBM6 between HhXyn5A and CtXyn5A were swapped (keeping the original interactions of the catalytic domain and the linker), as well as the linker region together with the CBM6 (keeping the interactions with the original linker and CBM6), to investigate and compare the influence of the CBM6 and the linker on catalytic activity and substrate affinity for these two homologous GH5_34 arabinoxylanases.

Results
Construction, production and purification of HhXyn5A and CtXyn5A enzyme variants Four variants of the two-domain enzymes HhXyn5A and CtXyn5A, consisting of the catalytic GH5_34 domain, inter domain linker region and the CBM6, were constructed from the respective amino acid sequences, with swapped CBM6, or swapped linker region and CBM6 (Fig. 1C).The resulting variants were termed indicating (from N-terminus to Cterminus) catalytic module-linker region-CBM6 for simplicity naming modules and linker region Hh or Ct according to their respective origin.The linker region natively present between the two modules was clearly identified in both CtXyn5A and HhXyn5A via determination of the domain boundaries of the respective enzyme, using a combination of available sequence annotations and tertiary structure comparisons (as described in Materials and methods).The HhXyn5A inter domain linker region constituted a stretch of 41 amino acids (Thr286-Thr326), while the corresponding region in CtXyn5A was 39 amino acids (Gly336-Thr374) (Supplementary Fig. S1).A mutated variant of HhXyn5A was also constructed by changing four residues of the HhCBM6 into the residue found at the corresponding position in CtCBM6 (H376W, D430F, G431Q, K459N), in order to investigate the influence of the exchange on the enzyme activity profile.The four mutated amino acid residues were identified in cleft A of the CtCBM6 as structurally essential for substrate binding and specificity of CtXyn5A (Correia et al. 2011;Norlander et al. 2023).
Oligopeptides of the purified variants Ct-Ct-Hh and Ct-Hh-Hh fragments, identified applying mass spectrometry (MS) analysis, confirmed proteolytic cleavages in the linker region sequence as most likely (Supplementary Fig. S2).Production of the mutated HhXyn5A-MUT variant (HhGH5_34-HhLinker-HhCBM6(H376W, D430F, G431Q, K459N)) (Fig. 1C) resulted in a high production level of recombinant protein purified to near homogeneity applying IMAC (data not shown).The pure HhXyn5A-MUT variant was stable in solution in both elution and formulation buffers.Observed differences in heterologous production of the enzyme variants in the soluble protein fraction and in protease susceptibility between the variants indirectly indicate a structural importance of the interactions between the catalytic domain and the linker region for protein solubility and integrity of the produced enzyme, as both HhXyn5A and Hh-Hh-Ct variants were demonstrated to be produced in the soluble protein fraction in the cultivations (Table 3).

Enzyme variant thermostability
The thermostability of the HhXyn5A and CtXyn5A variants were investigated by measuring melting temperature (T m ) applying nano scale differential scanning fluorimetry (nanoDSF) at pH 6.5 (Table 1).The HhXyn5A was unfolding at approximately 60 • C, whereas CtXyn5A was more thermostable demonstrating a T m of approximately 75 • C. The difference in thermostability between HhXyn5A and CtXyn5A, at the optimal pH 6.5, was in accordance with previous data, where a T m of 61 and 73 • C, respectively, was Table 1.Melting temperatures (T m ) ± standard deviation of the enzyme variants in reaction buffer (10 mM HEPES-HCl pH 6.5 (RT)) determined by the nano scale differential scanning fluorimetry.Measurements were performed in triplicates and T m was extracted from the first derivative of the absorbance ratio 350/330 nm.

enzyme variant
melting temperature (T m , • C) obtained (Norlander et al. 2023).The HhXyn5A-MUT variant, with four amino acid residues mutated in the sequence of HhCBM6, displayed a thermostability similar to HhXyn5A, with a T m slightly lower than 62 • C. The domain-swapped variants Hh-Hh-Ct and Hh-Ct-Ct were more thermostable than HhXyn5A with a T m of 3.3-3.6• C, respectively, indicating stabilisation of the variant structures by including CtCBM6.Inclusion of the HhCBM6 demonstrated a negligible increase in thermostability as the T m of Ct-Ct-Hh and Ct-Hh-Hh increased only by 1.0-1.4• C, respectively, as compared with the thermostability of the original CtXyn5A (Table 1).

Enzyme variant activity and oligosaccharide product profiles
Hydrolytic activity of the enzyme variants on AX substrates was determined as reducing end formation with the dinitrosalicylic acid (DNS) assay, measured after 24 h of incubation at 50 • C (Fig. 2).The reducing end formation was applied to calculate percent of AX degradation of total AX content in the respective substrate that all had an A/X ratio of 0.6.HhXyn5A and CtXyn5A demonstrated comparable activity levels on all substrates (Fig. 2) as previously observed (Norlander et al. 2023).Compared to the two-domain HhXyn5A variant (Fig. 1C) (Norlander et al. 2023), the domain-swapped Hh-Ct-Ct variant, with both linker and CBM6 swapped, displayed significantly lower activity on all substrates tested: wheat arabinoxylan (WAX), rye arabinoxylan (RAX), and alkali extracted oat bran fibre (OBF).Specifically, the Hh-Ct-Ct activity was only in the range of 9%-24% compared to the activity level of HhXyn5A, depending on the substrate used (Fig. 2).The effect on activity was less pronounced for the variant Hh-Hh-Ct, with only CBM6 swapped, but still demonstrated a substrate specific dependence with low activity on OBF.The activity level of the Ct-Hh-Hh variant, again with both linker and CBM6 swapped, was reduced to as little as 4%-10% of CtXyn5A (Fig. 2), depending on the type of substrate tested.On the other hand, the Ct-Ct-Hh variant, with only the CBM6 swapped, demonstrated comparable activity levels to the original CtXyn5A, on all three substrates (Fig. 2).This indicates that the influence of the CBM6-exchange was lower for the variants with the CtGH5_34 catalytic module, while the variants including the HhGH5_34 catalytic module demonstrated a notable reduction in activity by the domain exchange on all substrates.Additionally, the HhXyn5A-MUT variant, with four amino acid residues in cleft A of the HhCBM6 mutated to the residues at the corresponding positions in CtCBM6, demonstrated a similar activity level as the Hh-Hh-Ct variant on WAX and OBF substrates, indicating that that specific interactions were more important than the overall position of the CBM6.
The oligosaccharide products of the enzymatic hydrolysis were analysed applying high performance anion exchange chromatography with pulsed amperometric detector (HPAEC-PAD) (Fig. 3).Both HhXyn5A and CtXyn5A produced broad product spectra from all substrates, generating peaks corresponding to elution times for various AXOS standards.It was also noted that CtXyn5A produced many additional products, seen as multiple unidentified peaks in the chromatograms, compared to the product profile from HhXyn5A.The Ct-Ct-Hh variant produced product spectra similar to HhXyn5A from WAX (Fig. 3A) and RAX (Fig. 3B) substrates, although at lower concentrations.Similar chromatogram peak patterns to HhXyn5A were also observed for the variants Hh-Hh-Ct and Hh-Ct-Ct on WAX and RAX, however at significantly lower concentrations (Supplementary Fig. S3), indicating that the complete set of interactions in CtXyn5A are necessary for the more complex product profile.No oligosaccharide products from any of the three substrates used could be detected applying HPAEC-PAD for the Ct-Hh-Hh variant.The HPAEC-PAD oligosaccharide product profile comparison supports the assumption that loss of substrate hydrolysis efficiency was a consequence of exchange of both the CBM6 and linker region for the two enzymes, especially for HhXyn5A variants including the HhGH5_34 catalytic module.Additionally, it should be noted that HhXyn5A-MUT variant oligosaccharide product pattern was similar to the product profile from the original two-domain HhXyn5A enzyme (Supplementary Fig. S4), and did not change as a consequence of the introduced mutations (while the activity level was more similar to the Hh-Hh-Ct variant).

Enzyme variant binding affinity to WAX
In order to investigate the CBM6 as well as the effect of swapping both linker and CBM6, on enzyme-substrate affinity, affinity gel electrophoresis was performed using WAX as substrate.The migration distances measured for all HhXyn5A and CtXyn5A enzyme variants (Fig. 4 and Table 2) did not differ substantially between the two native PAGE gels, with incorporated WAX and control gel without substrate, indicating very low binding affinity to the substrate for all enzyme variants.The calculated and normalized migration distance ratio (MDR) demonstrated a slight migration retardation and increased binding to WAX for the domain-swapped variants Hh-Hh-Ct (0.90) and Hh-Ct-Ct (0.88), compared to original HhXyn5A (0.94), reflected as lower MDR values.Thus, the CtCBM6 appears to display a somewhat increased binding to the substrate.In contrast, Ct-Ct-Hh and Ct-Hh-Hh variants displayed higher MDR values (1.24 and 1.09, respectively) compared to CtXyn5A (1.05), indicating slightly weaker binding of CtXyn5A domainswapped variants to WAX.

Amino acid interactions in the linker region
To further investigate the potential interactions between the linker region, the catalytic GH5_34 domain and the CBM6, the domain-swapped variants were modelled and compared with CtXyn5A and HhXyn5A.All homology models were created using the three available structures of CtXyn5A (PDB 5LA2; PDB 2Y8K; PDB 5G56) as templates.Created variant structure models displayed stability after a 50 ns molecular dynamics (MD) simulation, with no major conformational changes observed (Fig. 5, RMSD plots shown in Supplementary Fig. S5).Potential polar interactions and πinteractions were identified between the linker region and the CBM6, as well as the catalytic domain for each enzyme variant structure (Supplementary Table S1).Both HhXyn5A domainswapped variants displayed a few more interactions between the linker region and the CBM6 compared to the original HhXyn5A enzyme and the Hh-Ct-Ct variant additionally displayed more interactions between the linker region and the catalytic domain.The total number of differing amino acid interactions between the swapped variants and HhXyn5A was however similar for both Hh-Hh-Ct and Hh-Ct-Ct, either due to loss of interaction or formation of a new interactions, although Hh-Ct-Ct variant exhibited more differences.The root-mean-square fluctuation (RMSF) plot (Fig. 5

Discussion
To investigate the influence of CBM6 on hydrolytic activity and specificity of GH5_34 enzymes, a comparison of   HhXyn5A and CtXyn5A with domain-swapped variants was performed (Table 3).The enzyme domain-swapped variant set was formed after interpreting the identified linker region as a structural element integrated with either the GH5_34 catalytic domain or the CBM6 domain.This assumption was based on the CtXyn5A structure analysis, which was indicating a stable conformation as well as a domain interface interaction network with the linker region (Correia et al. 2011).Production of the Ct-Ct-Hh and Ct-Hh-Hh variants resulted in comparatively low yields and domain splitting, and was negatively influenced by the exchange of the CtCBM6 to HhCBM6.Additionally, the aggregation observed for the Hh-Ct-Ct variant indicates that improper folding may occur as a result of swapping the linker region as well as the CBM6.This indirectly supports the assumption that the linker region of HhXyn5A and CtXyn5A is an important element for proper domain interaction and folding, of importance for storage stability in solution.Influence of the linker region on thermostability in the domain-swapped enzyme variants was, however, not observed.Exchange of the CBM6 domains, however led to improved thermostability in the variants including the more thermostable domain.A previous study on CtXyn5A suggests that the polar and apolar interactions of the linker region, make the catalytic domain and the CBM6 co-dependent (Correia et al. 2011).In the present study, the identified number of amino acid interactions in modelled HhXyn5A and CtXyn5A variant structures did not differ much between the enzymes, but as the linker region is conserved, the interactions in conserved parts can be restored even after a linker exchange.Thus, the conclusion that the linker substantially influences the position of the domains, activity and thermostability of the GH5_34 enzymes is valid also in this study.Ct-Ct-Hh displayed the least differences in interactions, when comparing the original enzyme with the domain-swapped variants, and retained most of the activity.In contrast, Ct-Hh-Hh variant that displayed most differences in linker interactions, demonstrated both protease susceptibility and loss of enzymatic activity.The comparison of RMSF plots indicates that the linker region in Hh-Hh-Ct and Hh-Ct-Ct variants is more stable than in the original HhXyn5A, while the swap of linker region and CBM6 connected to the CtXyn5A catalytic domain led to significantly higher fluctuation in the linker region of the Ct-Hh-Hh variant, that also displayed the lowest activity.
The enzyme variants Hh-Hh-Ct and Ct-Ct-Hh with the linker region connected to the natively corresponding catalytic domain displayed higher activity on AX substrates, than Hh-Ct-Ct and Ct-Hh-Hh variants where the CBM6 and linker region originated from the same enzyme but were not of the same origin as the catalytic module.These constructs demonstrated only limited activity, and when applying HPAEC-PAD no oligosaccharides could be detected.The Ct-Ct-Hh variant displayed high activity, comparable to the original CtXyn5A, however interestingly displayed product spectra which instead were more similar to the product pattern of HhXyn5A.This indicates that the CBM6 in CtXyn5A is at least partially responsible for activity and specificity of the enzyme allowing formation of additional shorter products.This supports the hypothesis in our previous study (Norlander et al. 2023), that the CBM6 is affecting the differences in product pattern.The linker region may, on the other hand, be responsible for the localisation of the binding site of CBM6 in relation to the catalytic GH5_34 domain, which may affect the ability of the CBM6 to act in a productive manner together with the catalytic domain.
The HhXyn5A-MUT variant demonstrated a similar activity level as the Hh-Hh-Ct variant and was notably less active than both the HhXyn5A and CtXyn5A enzymes, while the HhXyn5A-MUT product pattern was similar to HhXyn5A, indicating release of fewer types of oligosaccharides, and the mutations (introduced based on residues in CtCBM6) did hence not improve the number of products released.
The observed limited activity of Hh-Ct-Ct and Ct-Hh-Hh, with no oligosaccharide products detected by applying HPAEC-PAD although showing activity applying the DNS assay, indirectly indicates release of longer oligosaccharides than the maximum size detectable by the chromatographic method.Hydrolysis products, released by GH5_34 arabinoxylanase, and longer than detectable by HPAEC-PAD has been reported and identified applying MS (Rudjito et al. 2023).Overall, the enzyme variants activity results demonstrate the importance of the linker region in hydrolysis, where the inter-domain linker region can play a significant role in positioning the CBM6 in relation to the catalytic site of the catalytic domain, thereby determining the efficiency in oligosaccharide production.
CtCBM6 has previously shown low to no binding affinity to longer xylan substrates, however the domain has been found to be necessary for binding shorter oligosaccharides to the enzyme, specifically by the aromatic residues Trp424 and Phe478 (Correia et al. 2011).In the present study, the CBM6 domains in both HhXyn5A and CtXyn5A enzyme variants showed low binding affinity to WAX as indicated by the minor differences in enzyme migration measured between the affinity electrophoresis gel with incorporated substrate and the control gel.A slight increase in retardation could be observed for Hh-Hh-Ct and Hh-Ct-Ct variants, as well as a decrease in retardation for Ct-Ct-Hh and Ct-Hh-Hh variants, indicating that the HhCBM6 has an even lower affinity to WAX compared to CtCBM6.In our previous study (Norlander et al. 2023), characterisation of HhXyn5A showed that HhCBM6 lacks the important aromatic residues that forms the classical binding cleft for XOS substrates, suggesting that HhXyn5A has lower affinity to XOS compared to CtXyn5A and is possibly less likely to hydrolyse XOS substrates.This assumption is further supported by the lack of the shorter oligosaccharide products from HhXyn5A, which are detected in the CtXyn5A enzyme HPAEC-PAD spectra.The HhXyn5A-MUT variant with mutated HhCBM6, is recreating the typical CBM6 aromatic cleft A of CtXyn5A and the hydrogen bonding that is missing in native HhCBM6, but still demonstrated a similar product pattern to HhXyn5A, which indicates that the aromatic stacking interactions in cleft A are not the sole factor determining protein-ligand interaction and product profile.The analysis and overall results suggest that HhCBM6 may have an effective stabilising function of the catalytic domain or an additional function in a potential cellulosome complex, functioning together with other domains to break down recalcitrant plant material.Although it was previously shown that the HhCBM6 is essential for hydrolytic activity, as the single catalytic domain is inactive (Norlander et al. 2023), it could be suggested that the inter domain linker region may be as important as the CBM6 for substrate association to the catalytic domain in the HhXyn5A arabinoxylanase.

Material and methods
Construction and cloning of HhXyn5A and CtXyn5A enzyme variants Enzyme variants were constructed by combining the catalytic GH5_34 domain with the linker region and CBM6 analogously to the domain organisation of the truncated two-domain HhXyn5A and CtXyn5A variants (Fig. 1C).Sequences of combined enzymes modules originated from GH5_34 arabinoxylanases HhXyn5A-FULL (GenBank NLC19267.1 (Campanaro et al. 2020)) and CtXyn5A-FULL (RefSeq WP_003513669.1 (Wilson et al. 2013)).Initial characterisation of domain organisation (Norlander et al. 2023) through NCBI Conserved Domain Database 3.17 (Lu et al. 2020) was further investigated re-evaluating enzyme sequence module boundaries by comparison of the HhXyn5A tertiary structure model (Norlander et al. 2023) with the CtXyn5A structure (PDB 2Y8K).Protein sequences were aligned defining the amino acid conservation and visualized using Jalview 2.11 (Waterhouse et al. 2009).Sequences encoding signal peptides predicted with the SignalP 6.0 server (Teufel et al. 2022) were omitted.Four constructed domain-swapped enzyme variants Hh-Hh-Ct, Hh-Ct-Ct, Ct-Ct-Hh and Ct-Hh-Hh (Fig. 1C) were cloned in frame with an N-terminal His 6 -tag encoding sequence.Enzyme mutated variant HhXyn5A-MUT was constructed by mutating four amino acid residues (H376W, D430F, G431Q, K459N) in the CBM6 sequence to the residues at the corresponding positions in CBM6 of the CtXyn5A variant (Fig. 1C) and cloned in frame with an N-terminal His 6 -tag encoding sequence.All constructed variant sequences were vectoradapted maintaining the native codon landscape, synthesised and cloned (Bio-Cat, Germany) in pET-21b(+) vectors (Merck).

Production and purification of recombinant enzyme variants
Heterologous overexpression in E. coli inducing with isopropyl β-D-galactopyranoside and purification applying IMAC of all enzyme variants was performed as previously described (Norlander et al. 2023) for HhXyn5A and CtXyn5A variants.The purified proteins were dialyzed at 4 • C through 3,500 Da MWCO regenerated cellulose membranes against formulation buffer (50 mM HEPES-HCl pH 6.5 (RT)) at 1:5000 volume ratio.If domain splitting of recombinant proteins was observed purification was performed using SIGMAFAST Protease Inhibitor Tablets (Merck) for protease activity inhibition.Buffer exchange to formulation buffer, while maintaining protease inhibition was performed applying group separation with HiTrap Desalting 5 mL (16 × 25 mm) columns (Cytiva) loading 10% (v/v) injection volume onto two tandem connected desalting columns at 0.5 mL/min loading flowrate.The commercial CtXyn5A (product number CZ00601) produced in E. coli and purified applying IMAC was purchased from NZYTech.Protein samples were filtered through 0.22 μm pore size regenerated cellulose filters.The protein concentration was determined by measuring absorbance at 280 nm (assuming A 280 1 = 1 mg/mL as well as considering absorption coefficients (Supplementary Table S2) predicted with the ProtParam tool (Gasteiger et al. 2005)) using BioSpec-nano spectrophotometer (Shimadzu).Purity and integrity of proteins were analyzed by visualization applying 4%-15% glycine-SDS-PAGE (Bio-Rad).If domain splitting of recombinant proteins was observed, purified fragments were excised from stained SDS-PAGE gel and analysed applying MS (Bruker Daltonics).

Enzyme stability
Enzyme variant T m was estimated in reaction buffer applying nanoDSF using a Prometheus NT.48 instrument (Nan-oTemper Technologies).Standard grade glass capillaries (Nan-oTemper Technologies) were filled with enzyme solution at a concentration of 0.1-0.2mg/mL.Thermal unfolding was performed with a temperature gradient between 20 and 95 • C at a 1 • C/min ramp rate and adjusting excitation power to 40%.T m was determined from the first derivative of the absorbance ratio 350/330 nm and was identified automatically by the instrument software PR.ThermControl 2.0 (NanoTemper Technologies).

Enzyme activity assay
Enzyme activity was measured by monitoring reducing end formation from different substrates with DNS assay (Miller 1959).Substrates were suspended at 10 mg/mL in reaction buffer (10 mM HEPES-HCl pH 6.5 (RT)) and added to 1.5 mL tubes at 9:1 volume ratio of substrate and enzyme preparation of 0.2 mg/mL.The reactions were incubated in thermoshakers at 50 • C with shacking at 500 rpm for 24 h (Norlander et al. 2023).Samples were collected to be used for DNS assay and oligosaccharide analysis.DNS reagent was added to samples at a 1:1 volume ratio to stop the reaction and the samples were then boiled for 10 min and chilled on ice, before measuring absorbance at 540 nm with Multiskan GO microplate spectrophotometer (Thermo Fisher Scientific).The reducing end formation was calculated based on a Dxylose standard curve and the results were divided with the theoretical reducing end formation of total AX degradation for each substrate, resulting in a hydrolysis percentage of total AX.Samples collected for oligosaccharide analysis were boiled for 10 min, subsequently diluted and filtered through 0.2 μm pore size PTFE filters.Substrates including soluble wheat arabinoxylan (WAX) (low viscosity) and soluble rye arabinoxylan (RAX) were purchased from Megazyme.Alkali soluble oat bran fibres (OBF) were extracted (Schmitz et al. 2022) from an insoluble oat fibre bran fraction obtained from oat processing, provided by Lantmännen Oats (Sweden) in 2020 from their production site oat mill in Kimstad (Sweden).AX content of the extracted oat fibres was determined through acid hydrolysis and quantification of arabinose and xylose through HPAEC-PAD as previously described (Schmitz et al. 2022).AX content of WAX, RAX and OBF were 95, 90, and 5.2%, respectively.All AXOS substrates had a similar A/X ratio of 0.6.

Affinity gel electrophoresis
Affinity gel electrophoresis was performed (Cockburn et al. 2023) in order to investigate the domain swapping effect on enzyme-substrate affinity comparing with determined HhXyn5A and CtXyn5A variant binding affinity to WAX.Native 10% PAGE gels were prepared incorporating 0,1% (w/v) WAX or without substrate.Running buffer (300 mM Tris-HCl, pH 8.8 (RT)) was used for protein separation performing PAGE at constant 10 V/cm.Native PAGE gels were stained with Coomassie Brilliant Blue R-250 Staining Solution (Bio-Rad) and detained using distilled water.Bovine serum albumin fraction (BSA) (Sigma-Aldrich) was used as non-binding reference protein.GelDoc Go imaging system (Bio-Rad) was used for gel documentation.The migration distance for the protein in native PAGE with incorporated WAX was divided with the migration distance in native PAGE control gel without WAX substrate resulting in calculated MDR.

Homology modelling for evaluation of the linker region amino acid interactions
Homology models of domain-swapped enzyme variants were generated using YASARA 21.12.19 program (YASARA Biosciences) in order evaluate the linker region interactions with the catalytic GH5_34 domain and CBM6 by comparing modelled structures of enzyme variants.The models were created by entering the respective amino acid sequence for each enzyme variant and running the macro "hm_build", a script which searches PDB for suitable templates and creates a model based on the four best aligned templates.The resulting templates suggested by the script and used for all models were the three available structures of CtXyn5A (PDB 5LA2; PDB 2Y8K; PDB 5G56).All models were subjected to molecular dynamics simulation for 50 ns at 298 K and pH 6.5 using YASARA program.The cubic simulation cell was chosen to be 20 Å larger than the protein in all directions, accommodating explicit water molecules as the solvent and Na + and Cl − as counter ions.Snapshots of the simulation were stored at regular intervals of 100 ps, to investigate the systems dynamics over time.Energy calculations were performed using the AMBER14 force field, providing insights into the distribution of energy and interactions within the simulated system.RMSD and RMSF plots were generated for each simulation.The energetically minimized snapshots of the structures from the simulation were superimposed and structurally compared evaluating hydrogen bonding and πinteractions as well as generating graphical representation of structures using PyMOL 2.5 (Schrödinger).Amino acid numbering correspond to the respective enzyme tertiary structure numbering.
) indicates significant fluctuations at the C-terminal region of the linker, peaking at 5.76 Å for HhXyn5A, 4.1 Å for Hh-Ct-Ct, and 3.74 Å for Hh-Hh-Ct variant.The reduced fluctuations may explain the increase in thermostability for Ct-variants.For the CtXyn5A variants, Ct-Ct-Hh displayed similar interactions compared to the original CtXyn5A enzyme, while Ct-Hh-Hh exhibited more differences, especially concerning interactions between the linker and the CBM6 domain.The RMSF plot of the CtXyn5A domain-swapped variants reveals that the highest fluctuation, reaching 7.42 Å, occurred at the C-terminal region of the linker in Ct-Hh-Hh variant.In contrast, Ct-Ct-Hh variant exhibited lower fluctuation (of 3.98 Å), which is lower than the fluctuations observed at the corresponding site in the CtXyn5A enzyme (4.95Å fluctuation value) (Fig. 5).

Fig. 3 .
Fig. 3. HPAEC-PAD chromatograms of oligosaccharide product profiles from enzyme HhXyn5A and CtXyn5A variants after 24 h using substrates (A) wheat arabinoxylan, (B) rye arabinoxylan, and (C) alkali extracted oat fibre.The chromatograms were zoomed in and the first few minutes of the elution profiles were omitted, in order to visualize hydrolysis products of interest.Arabinoxylo-oligosaccharide standards are represented in the black curve of the chromatogram.A = arabinose; X = xylose.

Fig. 5 .
Fig. 5. Enzyme variants structure homology models accompanied by schematic representation of the domain organization as well as the corresponding RMSF plots after a 50 ns molecular dynamic simulations.A) HhXyn5A.B) Hh-Hh-Ct.C) Hh-Ct-Ct.D) CtXyn5A.E) Ct-Ct-Hh.F) Ct-Hh-Hh.Graphical representations of structures were generated using PyMOL 2.5.The linker regions in RSMF plots are indicated in red.

Table 2 .
Affinity gel electrophoresis enzyme variant migration distances and resulting MDR values.

Table 3 .
Enzyme variant characteristics.Scale of enzymatic activity is based on DNS and HPAEC-PAD results b (Norlander et al. 2023) c Based on SDS-PAGE visualization d Based on SDS-PAGE visualization (Correia et al. 2011); NA = not applicable; ND = not determined. a